Skip to content

Further refinements#18

Merged
samnooij merged 11 commits intomainfrom
dev
Aug 29, 2025
Merged

Further refinements#18
samnooij merged 11 commits intomainfrom
dev

Conversation

@samnooij
Copy link
Copy Markdown
Member

General workflow refinement, including code updates and expansion of the documentation.

samnooij and others added 8 commits August 27, 2025 16:32
Update Spacepharer to point at the latest version
and make sure each YAML file neatly ends with a newline character.
- Update the index, removing old entries and adding new ones.
- Fill in links to output file documentation.
- Move link to the user manual to the top.
- Include MLST in documentation

- Tick box in to do
- Add output to 'rule all'
- Include rules to concatenate output in one file
- Include PADLOC documentation
- Reference additional functions in index
- Add 'extra functions' and 'output files' entries
Correct the output directory and parsing script. These have changed slightly in the updated Jaeger version.
@samnooij samnooij requested a review from Necopy-byte August 29, 2025 10:13
Copy link
Copy Markdown
Collaborator

@Necopy-byte Necopy-byte left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good, there are some minor spelling errors and I remembered that phagescope does not need to take up 320GB as spacepharer should be able to accept it Gzipped

Comment thread README.md Outdated
Comment thread README.md Outdated
Comment thread doc/dev_notes.md Outdated

The Phagescope database says that it can filter genomes based on criteria,
but actually downloading these fastas is impossible due to an error.
Additionally, `wget` and `curl` do not properly download the databases in a way that
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This has been solved with the script i wrote, apparently the server was just down when i tried to download it originally via wget.

Comment thread doc/dev_notes.md Outdated
page cannot be solved using the recommended 'strict channel priority' setting.
We could only get it to work with the flexible or disabled channel priority.
We have adapted to YAML so that it can be solved using strict priority mode,
but this does make strand prediction in CRISPRidentify non-functional.
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Strand prediction and also apparently the optional cas prediction module CRISPRcasidentifier

Comment thread doc/manual.md
- [PLSDB](spacepharer.md#plsdb-2024_05_31_v2): ~80GB
- [PLSDB](spacepharer.md#plsdb-2024_05_31_v2): ~80GB

- [Phagescope](spacepharer.md#phagescope): ~320GB
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The file size does remind me that spacepharer should be able to accept gzipped files for its database creation, i will make an issue for this.

samnooij and others added 3 commits August 29, 2025 14:04
Co-authored-by: Necopy-byte <g.s.a.vanwaltmeijer@students.uu.nl>
Co-authored-by: Necopy-byte <g.s.a.vanwaltmeijer@students.uu.nl>
@samnooij
Copy link
Copy Markdown
Member Author

Thanks for checking and suggesting fixes @Necopy-byte !

@samnooij samnooij merged commit 99eed6b into main Aug 29, 2025
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Rule 'collect_jaeger_batch' fails on test dataset Rule collect_jaeger_predictions appears to get stuck

2 participants